128 PART 3 Getting Down and Dirty with Data
Software can provide various enhancements to the basic box plot. Figure 9-8b
illustrates two such embellishments you may consider using:»
» Variable width: The widths of the bars can be scaled to indicate the relative
size of each group.»
» Notches: The box can have notches that indicate the uncertainty in the
estimation of the median. If two groups have non-overlapping notches, they
probably have significantly different medians.
Depicting the relationships between
numerical variables with other graphs
We started this chapter by developing summary statistics and making graphs of
one numeric variable at a time. One example was where we took seven measure-
ments of diastolic blood pressure (DBP) from a group of study participants and
developed summary statistics. This is called a univariate analysis because it only
concerns one variable. But in the example of box plots in the preceding section, we
conducted a bivariate analysis because we were looking at the relationship between
two variables in a sample of patients from four different clinics. The two variables
were enzyme levels, and source clinic (Clinic A, B, C, or D). We could have done
another bivariate analysis looking at two continuous variables (such as two differ-
ent enzyme levels in participants) using a scatter plot, which is covered thor-
oughly in Chapter 16.
This chapter focused on univariate and bivariate summary statistics and graphs
that can be developed to help you and others better understand your data. But
many research questions are actually answered using multivariate analysis, which
allows for the control of confounders. Being able to control for confounders is one
of the main reasons biostatisticians opt for regression analysis, which we describe
in Part 5 and Chapter 23. In these chapters, we cover the appropriate summary
statistics and graphical techniques for showing relationships between variables
when setting up multivariate regression models.